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A CONSISTENT MARKOV PARTITION PROCESS GENERATED 
FROM THE PAINTBOX PROCESS 
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Abstract 

We study a family of Markov processes on P^''' , the space of partitions of the 
natural numbers with at most k blocks. The process can be constructed from a 



CLj ' Poisson point process on R+ x ni=i ^ with intensity dt^gi, , where g^ is the 

^ . distribution of the paintbox based on the probability measure v on Vm, the set 

-4— > . 
Cy ■ of ranked-mass partitions of 1, and q\j is the product measure on Y\i=i P ■ 

We show that these processes possess a unique stationary measure, and we 

discuss a particular set of reversible processes for which transition probabilities 

^ ' can be written down explicitly. 
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1. Introduction 

Markov processes on the space of partitions appear in a variety of situations in 
scientific literature, such as, but not limited to, physical chemistry, astronomy, and 
population genetics. See Aldous [1^ for a relatively recent overview of this hterature. 
Well-behaved mathematically tractable models of random partitions are of interest 
to probabilists as well as statisticians and scientists, [10] , [H] , [S] , [S] ■ Ewens [TU] first 
introduced the Ewens sampling formula in the context of theoretical population biology. 
Kingman's [12j coalescent model was introduced as a model for population genetics, 
still its most natural setting. However, since the seminal work of Ewens and Kingman, 
random partitions have appeared in areas ranging from classification models, as in 
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2 Harry Crane 

H; [IS]: to probability theory, see [3],[I7]. McCuUagh [T3] describes how the Ewens 
model can be used in the classical problem of estimating the number of unseen species, 
introduced by Fisher ill; and later studied by many, including Efron and Thisted [9]. 

Berestycki [2] studies a family of partition processes, called exchangeable fragmentation- 
coalescence (EFC) processes, whose paths are generated by a combination of inde- 
pendent coalescent and fragmentation processes. The mathematical tractability of 
coalescent and fragmentation processes has led to the development of many results for 
EFC processes and has led to interest in more complex models. For a sample of these 
results and relevant references see [3] , [15] , [17] . The study of processes, such as the EFC 
process, which admit a more general study of partition- valued processes is of interest 
from a theoretical as well as applied perspective. In this paper, we study a family of 
processes which is similar in spirit to the EFC process, but whose sample paths are 
quite different. 

2. Preliminaries 

Throughout this paper, V denotes the space of set partitions of the natural numbers 
N. We regard an element B oiV as a. collection of disjoint non-empty subsets, called 
blocks, written B = {Bi^B2, ■ ■ ■}, such that IJi ^i ~ -^^ The blocks are unordered, 
but, where necessary, they are listed in the order of their least element. We write 
B = {Bi, i?2, . • ■) whenever we wish to emphasize that blocks are listed in a particular 
order. For B ^ V and b E B, i^B is the number of blocks of B and #6 is the 
number of elements of b. For any A C N, let i3|^ denote the restriction of B to A. 
Wherever necessary, V'^'^^ denotes the space of partitions of N with at most k blocks, 
i.e. 'P^'^) := {B eP : #B < k}. For fixed n e N, V[,^] and v''^^ are the restriction to 
[n] := {1, . . . , n} of V and 'P'*^^ respectively. 

It is sometimes convenient to regard a partition B as either an equivalence relation 
defined by B{i,j) — I -i^ i ^b j or an n x n symmetric Boolean matrix whose 
(i,j)th entry is B(i,j). These three representations are equivalent and we use the 
same notation to refer to any one of them. 

For each tt, tt' G V, we define the metric d : V x P ~> M. such that 

d{TT,TT') = l/max{n £ N : 7r|[„] = 7r|[„j}. 
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The space {V,d) is compact [S]. 

In addition, we define the projection Dm,n '■ T^[n] ~^ T^[m] ^oi each n > ?7i > 1 by 
Dm.nBin] = ^[n]|[m]- In the matrix representation, Dm.nB is the leading m x m sub- 
matrix of B. We seek processes B :— {Bt,t > 0) on P such that for each n e N, the 
restriction of B to [n], i?|[„i, is finitely exchangeable and consistent. That is, 

• fT(i?|[„i) ~ -S|[n] ioT each a £ Sn, the symmetric group acting on [n], and 

• -B[„]|[,„] -- S|[„j] for each m < n. 

It is more convenient to work with V as the state space of our process than the space 
'Pm — {(si,S2,...) : si > S2 > . . . > 0, J2i^i — 1} of rankcd-mass partitions of 
X e [0, 1]. In accordance with the notation for set partitions, let Vm '■= {s £ Vin : 
Sj = Vj > k, X]i=i ^i ~ 1} denote the ranked /c-simplex. There is an intimate 
relationship between exchangeable processes on V and processes on Vm through the 
paintbox process. 

For s S "Pm, let X := (Xi,X2, . . .) be independent random variables with distribu- 
tion 

0, o.w. 

The partition Il{X) generated by s through X satisfies i ^u{X} 3 if and only if X,; = Xj. 
The distribution oiIl{X) is written Qs and n(X) is called the paintbox based on s. For 
a measure v on Vm, the paintbox based on ly is the i/- mixture of paintboxes, written 
Qi'i') '■— J-p Qsi')i^ids). Any partition obtained in this way is an exchangeable random 
partition of N and every infinitely exchangeable partition admits a representation as 
the paintbox generated by some v. See and [ITj for more details on the paintbox 
process. 

We are particularly interested in exchangeable Markovian transition probabilities 
(pn), where, for every n, p„ is a transition probability on V[n] which satisfies 

Pn{B,B')= Yl Pn+l{B*,B"), (2.1) 

S"er',;.'„+i(i3') 

for each B,B' e V[n] and B* e D:;^^„^^{B). Burke and Rosenblatt 8^ show that ([^11 
is necessary and sufficient for {p„) to be consistent under selection from N. 
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Likewise, for a continuous-time Markov process, {Bn{t),t > 0)neN, where Bn(t) is 
a process on Vin] with infinitesimal generator Qn, it is sufficient that the entries of 
Qn satisfy (|2.ip for there to be a Markov process on P with those finite-dimensional 
transition rates. 

3. The ^j^-Markov chain on pC^) 

(k) 

Let n, fc e N and let i^ be a probability measure on the ranked fc-simplex Vm , 
so that the paintbox based on v is obtained by a conditionally i.i.d. sample from v, 
i.e. given s ^ v, Xi, X2, . . . are i.i.d. with Fs{Xi = j) = Sj for each j = 1, . . . , fc. For 
convenience, we write B G V^''^ as an ordered list {Bi, . . . , Bk) where Bi corresponds to 
the ith block of B in order of appearance for i < #5 and S^ = for z = ^B + 1, . . . ,k. 

Consider the following Markov transition operation B ^^ B' on 7''-'^-'. Let B = 
{Bi, . . . , Bk) € V^''^ and, independently of B, generate Ci, C2, . . . which are indepen- 
dent and identically distributed accord to p,y . For each i, we write Ci :— {Cn, . . . ,Cik) S 
p'-''\ Independently of B, Ci, C2, . . . , generate cti, 1T2, • ■ • which are independent uni- 
form random permutations of [fc]. Given a := (ci, (72, . . . , Ufc), we arrange B,Ci, . . . ,Ck 
in matrix form as follows: 

C.i C.2 . . . C.k 

Bi / Ci^cri(i) ni?i Ci^o.j(2) ni?i ... Ci^o-i(fc) ni3i 

B-2 ^2^^2(1) ni?2 C'2,o-2(2) ni?2 •■• C2^cr2(k)^B2 

=: B n C . 

Bk \ CkMkii) ^ Bk Cfe^o-fc(2) n i?fe ... Cfc^o-fc(fc) n Bk J 

B n C^ is a matrix with row totals corresponding to the blocks of B and column totals 
C'.j = Ui=i(^j,<Ti(j) ri-Bi). Finally, B' is obtained as the collection of non-empty blocks 
of (C.i, . . • , C.k)- The non-empty entries of B n C^ form a partition in V'^^ -* which 
corresponds to the greatest lower bound BAB'. 

Proposition 3.1. The above description gives rise to finite- dimensional transition 
probabilities on Vi i 

Vn{B. B'- V) = ^^- n fc, " g^(^|J- (3-1) 
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Proof. Let A £ V^'^K Fix n, fc e N, put B := yl|[„] e P^^^^j^ Let Ci, . . . , Cfe be i.i.d. 
Pi,-distributed partitions and a := (ai, . . . , ak) i.i.d. uniform random permutations of 
[k] as described above. Let B' be the set partition obtained from the column totals of 
the matrix B n C"^ in the above construction. 

From the matrix construction, it is clear that for each i = 1, . . . , fc, the restriction 
B|g. is equal to the set partition in V^J associated with Ci[i?i] :— {Cn HBi,. . . , du n 
Bi). Conversely, the transition B i-^ B' occurs only if the collection (Ci, . . . ,Ck) is 
such that, for each Bi E B, Ci[Bi] = B'.g . By consistency of the paintbox process, for 
each i = 1, . . . , fc, Ci[Bi\ has probability 

Independence of the Ci implies that the probability oi B A B' given B is 

n oABi,). 

beB 

Finally, each uniform permutation ai has probability 1/fc! and there are ^, _^„,^| Jibes ('''^ 
^BLy. collections cti, . . . ,cr#s such that the column totals of i? fl C"' correspond to 
the blocks of B' . This completes the proof. 

For fixed n, p.l|) only depends on B and B' through g^, and the number of blocks 
of B and B' and is, therefore, finitely exchangeable. I appeal to (|2.ip to establish 
consistency. 

Proposition 3.2. For any measure v on Vm , let {Pn{': 'l'^))n>i be the collection of 
transition probabilities on 7^r„i defined in (j3.ip . Then (p„) is a consistent family of 
transition probabilities. 

Proof. Fix n, fc G N and let B,B' G 'PiJ ■ To establish consistency it is enough to 
verify condition (|2.ip from theorem 1 of [S], i.e. for each v and B* G D^\^j^{B), 

Pr,+i{B*,D-X+i(B'); ^) = Pn{B, B'; I.). 

We assume without loss of generality that B* G D^^{B) is obtained from B by the 
operation n + 1 n> _Bi G -B and we write BI := Bi L) {n + 1}. Likewise, for B" G 
D~^^_^_^{B') obtained hy n + 1 ^ B^ e B' U {0}, write Bf := B^ U {n + 1}. So either 
n + 1 G B'j* for some i = 1, . . . , ^B' or n + 1 is inserted in B' as a singleton. 
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The change to B n C"^ that results from inserting n + 1 into Bi <E B and B'^ e B' is 
summarized by the following matrix. Note that i?' = for j > #i?'. 



B[ 



B' 



Bl 
B2 



f B[r\Bi B'^n Bl 
B[ n B2 B'^n B2 



Bk \yB[r\ Bk B'2 n Bk 



B'* 
(B^nBi)U{n+l} 

B[ n B2 
B'.nBk 



... Bl 

... BlnBi\ 

... B',nB2 

... B'.nBk) 



Here, the blocks of B are listed in any order, with empty sets inserted as needed, and 
the blocks of B' are listed in order of least elements, with k — #i3' empty sets at the 
end. 

Given B' , the set of compatible partitions Dj7n+i(^') consists of three types de- 
pending on the subset Bi C [n] and the block of B' into which {n + 1} is inserted. Let 
B" e I?,Yn+i(^') be the partition of [n + 1] obtained by inserting n + 1 in B' . Either 



(i) n + 1 is inserted into a block B^ such that B,' n Bi ^ ^ #B'.'g, = #B[ 



\Bi 



(ii) n + 1 is inserted into a block B'^ ^ such that B[C\B\ 



or 



=> #B[^. -#B[^^+1, 



(iii) n + 1 is inserted into B' as a singleton block => #Bj'^, — H^B'.g + 1 and #-B'' 
#5' + 1; we denote this partition by BL. 



There are k — ^B' empty columns in which {n + 1} can be inserted as a singleton in 
B' , as in (iii). For B" obtained by (ii), the restriction of B" to Bl coincides with the 
restriction of B^^ to B^, so each of these restrictions has the same probability under 
g^. For notational convenience in the following calculation, let T>i be those elements 
of D^\_^_^{B') which satisfy condition (i) above and V2 those which satisfy condition 
(ii). 
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p„+i(B*,D„,^„+i(B');' 

E 



k\ 



n 



(fe - #5")! 



(fc - #B'{,) 
k\ 



■Q.{B'{, 



(3.2) 



A;! 



{k~#B') 



n 



(fc - #gf. 

fe! 



E n ^"(^10+ 



+ E i-h^ n ^^(^i^)+i^r^ n ^^(^^ 



k-#B,„ 



fc! 



{k~#B') 



n 



(fc - #gf. 

fe! 



■ n S,AB\, 

beB'-.bytBl 

k~#B' 



Bi beB* 



(3.3) 



E o.{bI'bO+ 



s"e-Di 



E^ f r>'i \ , '^ ~ tF-" ^d' 



B"eT>2 
k\ 



{k-#B')\ 

k\ 

{k~#B') 



11 fe^ 11 ^-(-^1" 

beS ■ b£B:b^B^ 

{k-#B'\^ 



n 

bes 



fc! 



66S:b/S* 



E O-^iBiB-*) + Qi'iB'mBl) 
B"£T>-i 

E ^''(^") 



fc! 



{k~#B') 



^(fc^#B(J! ^ ,^(Bf,) [,.(B[,J 



beB 



(3.4) 

(3.5) 
(3.6) 



beS:b7^SJ 



fc! 



(k-#B') 
PuiB,B';u) 



(fc-#B(J! 

11 ri [[oABib 



beB 



beB 



m 



(fe-#B|b)! 



out of the 



Here, p.3p is obtained from p.2p by factoring /fc_ig/v Hbga fcT 

sum and using observations (i), (ii) and (iii). In p.4p . we use the fact that for any 
B" G 2?2, ^iB* = ^0|_B*i 9'iid there arc =i^i?' — ifB'.g elements in 2?2 according to 
(ii). Line p. 51) follows by observing that each B" e 2?i corresponds to an element 
of -Ci^Si.#Bi+i(^|Si) and B'^^^g, is the element of £'#b^^#b^+i(B[5J obtained by 
inserting {n+ 1} as a singleton in B'.^ . Finally, p.6p follows from p.Sp by consistency 
of the paintbox process. This completes the proof. 

The following result is immediate by finite exchangeability and consistency of (j3.ip for 
every n and Kolmogorov's extension theorem (theorem 36.1, j6j). 
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Theorem 3.1. There exists a transition probability p{-, •;!/) on ('P''^'-',cr (lJ„'Pr i ) ) 
whose finite- dimensional restrictions are given by p.ip . 

We call the discrete-time process governed by p{-, ■; ly) the gi^-Markov chain with state 
space ■p'-*^^ 

3.1. Equilibrium measure 

From dnH), it is clear that for each n,k e N and B,B' e P[^p Pn{B,B';iy) 
is strictly positive provided v is such that ^{s) > for some s = (si,...,Sfc) G 

(k) 

Vm with Sfc > 0. Under this condition, the finite-dimensional chains are aperiodic 

(k) 

and irreducible on 7^r„i and, therefore, have a unique stationary distribution. In 
fact, the finite-dimensional chains based on v are aperiodic and irreducible provided 
v is not degenerate at (1,0,..., 0) G Vm . The existence of a unique stationary 
distribution for each n implies that there is a unique stationary probability measure 
on (v^''\<j(\Jn'P[n])) iorp{-,-,i^) from theorem [SH 

Proposition 3.3. Let v be a measure on Vm such that v is non- degenerate at (1, 0, . . . , 0) G 
Vm ■ Then there exists a unique stationary distribution 9n{-\v) for Pn{-, -'ji^) for each 
n>l. 

Proof. Fix n e N and let i^ be any measure on Vm other than that which puts 
unit mass at (1,0,..., 0). For B = {Bi, . . . ,Bm) £ ^Li i P-ip gives the transition 
probability 

u ™ 1 

and Qu{Bi) — Qv{[#Bi\) > for each i — 1, . . . ,m. Hence, pn{B,B;i/) > for every 

(k) 

B G V^^^ and the chain is aperiodic. 

To see that the chain is irreducible, let B, B' G "Pr i a-nd let 1„ denote the one block 
partition of [n]. Then 



On{B,l^-v) ^ k\{\e,{[#b]) > Q 



k' 

beB 



and, since u is not degenerate at (1,0,..., 0), there exists a path 1„ k->- B' by recursively 
partitioning 1„ until it coincides with B' . For instance, let B' := {B[, . . . , i?^„) G V^'^' . 
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One such path from 1„ to _B' is 

m m 

1=2 i=3 

which has positive probabihty for any non-degenerate ly. Hence p„(-, -'jv) is irreducible, 
which estabhshes the existence of a unique stationary distribution for each n. 

Theorem 3.2. Let v he a measure on Vm such that t^((l,0, . . . ,0)) < 1. Then there 
exists a unique stationary probability measure 0{-\ v) for the g,y-Markov chain on V^^' . 

Proof. For v satisfying this condition, proposition 13.31 shows that a stationary dis- 
tribution exists for each n > 1. Let {6n{-\v),n > 1) be tire collection of stationary 
distributions for the finite-dimensional transition probabilities (p„(-, •; i/),n > 1). We 
now show that the On are consistent and finitely exchangeable for each n. 

Fix n G N and let B e V, , . Then stationarity of 6'„(-; v) implies 



Y, en{B'-u)pn{B',B-v)^e^{B-u). 



B'ev^^^l 



Now write 6'„(-) = 0„(-; v) and Pn{-, •) = Pn{-, S i^) for convenience and let B' G "Pr 
We have 



^ en+i{B") = E E On+i{B*)Pn+i{B*,B") 

"^ V ' 

(e„+in-'„,i)(B') 



(k) 



/ , dn+l{B* 






E Pn+l{B\B") 



= E E en+i{B*)[pn{B,B')] 

= E Pn(B,B') E (^n+l{B*) 

= E Pn{B,B'){9n+iD-X+,){B). 

So we have that 9n+iD^ ^^^-^ is stationary for p„ which implies that 0„ = 0ri+i£',^„+i 
by uniqueness and 0„ is consistent for each n. 
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Let CT be a permutation of [n]. Then for any B,B' G ^Li j Pn{<^{B),a{B')) — 
Pn{B, B') by exchangeability oi Pn- It fohows that 9n is finitely exchangeable for each 
n since 

Y. e,MB))Pn{'j{B),a{B')) = 9.^{a{B')) 

by stationarity, and Pn{(j{B),a{B')) ^ pn{B, B') implies that 

J2 e^{a{B))pn{B,B') = en{cj{B')). 

Hence, 0„ o cr is stationary for p„ and 0„ = 0„ o cr by uniqueness. 

Kolmogorov consistency implies that there exists a unique exchangeable stationary 
probability measure 9 on V'^^^ whose restriction to [n] is 0„ for each n G N. This 
completes the proof. 

4. The ^^-Markov process in continuous time 

Let A > 0, i^ be a measure on Vm and for each n e N define Markovian infinitesimal 
jump rates for a Markov process on V}-! by 

f XpJB,B';v), B^B' 
I 0, o.w. 

where p„ is as in p.ip . The infinitesimal generator, QJ^, of the process on V/} governed 
by qn has entries 

f pJB,B';u), B^B' 
Q-^{B^B')=\x{ ^"^ ' ' ^' ^ (4.2) 

[ Pn{B,B-y)^l, B^B'. 
We now construct a Markov process -B := {B{t),t > 0) in continuous time whose 
finite-dimensional transition rates are given by ()4.ip . 

Definition 4.1. A process B :— {B{t),t > 0) on V^^' is a gii^-Markov process if, for 
each n e N, i?| [„] is a Markov process on P, , with Q-matrix Q'^^ as in (14.21) . 

A process on py^> whose finite-dimensional restrictions are governed by Q'^ can be 
constructed according to the matrix construction from section 3 by permitting only 
transitions B ^^ B' for B' =/= B, where B, B' G ^r„i : and adding a hold time which is 
exponentially distributed with mean — 1/(35^(_B, _B). 
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Proposition 4.1. For a measure v onVm , let {Qjjnefi ^£ ^^2 collection ojQ-matrices 
in (|4.2p . For every n g N, the entries of Q'^ satisfy (|2.ip . 

Proo/. Fix n G N and let B, B' e T'j*;^^^ such that B ^ B' . Then 

Q;;(i?,i?')= E Qr.+i(s*,i?") 

for ah B* e D^j^j^i{B) by the consistency of p„ from proposition 13.21 
For B' ^ B and B^ e D-},^^^{B), we have 

J2 Qn+l{B.,B") = 

Q"„+i{B„B,)+ Y. Qn+iiB„B") 



Pn+i{B,,B,;u)^l+ Yl Pu+i{B*,B";u) 

B" eD-'^„+l(B)\{B,} 



X J2 Pn+i(B„B'»-l 

= \{pn{B,B-u)~l) 

= Qn{B,B). 

Theorem 4.1. For each measure v on Vni , there exists a Markov process {B(t),t > 0) 
on "pC^) which has finite- dimensional transition rates given in (|4.ip . 

Proof Let z^ be a measure on Vm and (_B|[„](i),t > 0)„gN be the coUection of 
restrictions of a Pi^-Markov process with consistent Q-matrices {Qn)neN as in (|4.2p . 
For each n, QJ^ is finitely exchangeable and consistent with Qn+i by proposition 14.11 
which is sufficient for -B|[„i to be consistent with Bu^^ii for every n. Kolmogorov's 
extension theorem implies that there exist transition rates, Q'^ , on P*^*^^ such that for 
every B,B' eVl^l 

QUB. B') = Q-(B,, {B" e PC-^) : i?[[„] - i?'}), 

for every B, e {B" e p('=' : B[|„] = B}. 

Finally, for every B e v\^l, Ql{B, v\^l \{B}) = A(l -p„(B, B; v)) < 00 so that the 
sample paths of B|[„] are cadlag for every n, which implies that B is cadlag. 
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Corollary 4.1. Forv which satisfies the condition of theorem \8.SX the continuous-time 
process B := [B(t),t > 0) with finite- dimensional rates qn{-,-',i^) in (j4.ip has unique 
stationary distribution 0{-;i') from theorem \3.S\ 



Proof. For each n G N, let 0„(-;;y) be the unique finite-dimensional stationary 
distribution of p„(-,-;z^) from (|3.1I) . It is easy to verify that for each n G N, OJ'j :~ 
{e.n{B-v),B ev\^\) satisfies 

(e:;)*g;; = o, 

which establishes that BJ'j is stationary for QJ'j for every n. The rest follows by theorem 



4.1. Poissonian construction 

From the matrix construction at the beginning of section |31 a consistent family of 
finite-dimensional Markov processes with transition rates as in (|4.ip can be constructed 
by a Poisson point process on ]R+ x ni=i ^'''^'' ^^ follows. Let P = {(t, Ci, . . . , Cfe)} C 
R+ X HzLi ^^'^■' be a Poisson point process with intensity measure dt ® Xgl, ' for some 



,(fe) 



(fe) 



Q^ on 



measure i/ on Via and A > 0, where pi/ is the product measure g^, • 

Construct an exchangeable process B :— {B(t),t > 0) on V^''^ by taking n G V^'^^ 
to be some exchangeable random partition and setting -B(O) = tt. 
For each n G N, put i?|[„](0) = 7r|[„] and 

• if i is not an atom time for P, then _B|r„i(t) = i?|r„i(i— ); 

• if f is an atom time for P so that (i, Ci, . . . , Cfe) G P, then, independently 
of {B{s)^s < t) and (t, Ci, . . . , Cfe) generate cri,...,crfc i-i-d. uniform random 
permutations of [k] and construct i?' from the set partition induced by the column 
totals (C.i, . . . ,C.fc) of 



C.i 



c- 



Bi 

B2 



I C'i,ai(i) n Bi C'i_CTi(2) n Bi 

C2,(J2(1) n i?2 (^2, 0-2(2) n i?2 



B 



C.fe 

Ci,o-i(fc) ni?i \ 



G 



2,o-2(fe) 



nB. 



fc \Ck,ak{l) n -Bfc Ck,ak{2) n i?A: 



Cfc 



fc,crfc(fc) 



ns, 



snc" 



.; 
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where (i?i, . . . , Bk) are the blocks of S = ^\[n] (t^) hsted in order of their least 
element, with k — ^B empty sets at the end of the list. 

• if S' ^ B, then S|[„](t) = B'; 

• ifS' = B, B|[„](f) = S|[„](i-). 

Proposition 4.2. The above process B is a Markov process on "pC^' with transition 
matrix Q" defined by theorem\jA\ 



Proof. This is clear from the consistency of both the paintbox process g^ and the 
(55^-matrices for every n and the fact that, by this construction, for any n such that 
B\[n]it) = TT then B[n]\[m]{t) = -Dm,„(7r) for all m < n and B\[p]{t) e £'^,p(7r) for all 
p > n. 

Let Ft be the semi-group of a g^-Markov process B{-), i.e. for any continuous (p : 

P((^(7r) -.^EMBit)), 
the expectation of Lp{B{t)) given B{0) = n. 
Corollary 4.2. A g^-Markov process has the Feller property, i.e. 

• for each continuous function ip : V^^' — )■ M, for each tt ^ V one has 

\imFtip{n) = ip{n), 

• for a/H > 0, TT M- Pf(p(7r) is continuous. 

Proof. The proof follows the same program as the proof of corollary 6 in [5] . 

Let Cf -.^ {/ : V^''^ -^ R : 3n e N s.t. 7r|[„] = 7r[[,^j => /(tt) = /(tt')} be a set of 
functions which is dense in the space of continuous functions from 'p(^) -^ M. It is clear 
that for g G Cf, linif^o FtgiTr) — .g(7r) since the first jump-time of B{-) is an exponential 
variable with finite mean. The first point follows for all continuous functions P^*^' -^ M 
by denseness of Cf . 

For the second point, let n, tt' G 'P'-'"'-' such that d{TT, tt') < l/n and use the same Pois- 
son point process P to construct two gij^-Markov processes, B{-) and B'{-), with starting 
points TT and tt' respectively. By the construction, i3|[„] — B',.-, and d{B{t),B'{t)) < 
l/n for all i > 0. It follows that for any continuous g, tt i— )■ Pj(7(7r) is continuous. 
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This allows us to characterize the gi^-Markov process in terms of its infinitesimal 
generator. Let B := {B{t),t > 0) be the gii^-Markov process on 'P'-'"'-' with transition 
rates characterized by {qn)neN as in (|4.ip . The infinitesimal generator, A., of B is given 

by 






for every f £ Cf. 



5. Asymptotic frequencies 
Definition 5.1. A subset A C N is said to have asymptotic frequency A if 

A := lim *{^<n■.^eA} ^^_^^ 

n— fC30 71 

exists, and a random partition B := (i?i, i?2, • • •) £ 7^ is said to have asymptotic 
frequencies if each block of B has asymptotic frequency almost surely. 

Adopting the notation of Berestycki [5], let A{B) = (||_Bi||, ||i?2||, ••■)"'' be the decreasing 
arrangement of asymptotic frequencies of a partition B = {Bi,B2, ■ ■ ■) £ P which 
possesses asymptotic frequencies, some of which could be 0. 

According to Kingman's representation theorem (theorem 2.2, [17 J any exchange- 
able random partition of N possesses asymptotic frequencies. Intuitively, this is a 
consequence of generating an exchangeable random partition of N by the paintbox 
process. 

The process described in section [3] only assigns positive probability to transitions 
involving two partitions with at most k blocks. From the Poissonian construction of 
the transition rates in section HTTj it is evident that the states of _B = {B{t),t > 0) will 
have at most k blocks almost surely. Moreover, the description of the transition rates 
in terms of the paintbox process allows us to describe the associated measure- valued 
process of B :— {B{t),t > 0) characterized by A and ly. 

5.1. Poissonian construction 

Consider the following Poissonian construction of a measure- valued process X := 
{X{t), i > 0) on V^n^. For any fc e N, A > and f as above, let P' = {(t. Pi,..., PI,)} C 
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M+ X Y[i=i ^m tie a Poisson point process with intensity measure dt (8) \i'''^\ where 
j/*^'') is the product measure v ® . . .®v on Y\^^l Via . 

Construct a process X :— {X{t),t > 0) on Vm by generating po from some 
probabihty distribution on Vm ■ Put X{Q) = po and 

• if i is not an atom time for P', then X{t) = X{t—); 

• if t is an atom time for P' so that {t, P[, . . . , P^) G P' , with Pj = (Pf , . . . , Pf ) 
for each j — l,...,fc, and X{t~) = (xi,...,a;fe) G Pm , then, independently 
of (X(s), s < i) and (i, P{, . . . , P^), generate cti, . . . , crfe i.i.d. uniform random 
permutations of [k] and construct X{t) from the marginal column totals of 



Pi Pi ... Pi 



Xi 
X2 






Xk \xkP^^(^i) XkP„^(2) ■■■ ^kP„^(k) J 

i.e. put X{t) = (Pi,Pi, ..., P^)J- := (Eti x^P^^^yl < 3 < k)\ 

Theorem 5.1. Let X := {X{t),t > 0) be the process constructed above. Then X =c 
A(P) where B :— {B{t),t > 0) is the g^j-Markov process from theorem \^.l\ 

Proof. Fix fc G N and let i^(-) be a measure on Pm . 

In the description of the sample paths of B in section |4l note that generating 
(Ci,...,Cfe) ^ Qi is equivalent to first generating Si ^ v independently for each 
i — 1, . . . , fc, then generating random partitions Ci by sampling from si for each i = 
1, . . . , fc. Finally, B'^ is set equal to the marginal total of column i of the matrix BOC^ , 
where a :— (cti, . . . ,ak) is an i.i.d. collection of uniform random permutations of [fc]. 
Hence, we can couple the two processes X and B together using the Poisson point 
process P' described above. 

Let X evolve according to the Poisson point process P' on M+ x Y[i=i P^ ^^ 
described above. Let B evolve by the modification that if t is an atom time of P' then 
we obtain partitions (Ci,...,Cfc) by sampling X^ :— (X{,X|,...) i.i.d. from P/ for 
each i — 1, . . . , fc, i.e. 

p{xi = j\Pl)^p;, 
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and defining tlie bloclcs of Ci as tiie equivalence classes of X^ . Constructed in this way, 
||C,j||= Pj almost surely for each i,j = l,...,k and (Ci, . . . ,Ck) ~ gl ■ 

After obtaining the Ci, generate, independently of i?, Ci, . . . , Cfc, P', i.i.d. uniform 
permutations cti , . . . , crfc of [k] and proceed as in the construction of section 14.11 where 
B,Ci, . . . ,Ck are arranged in the matrix B D C^ and the blocks of B' are obtained as 
the marginal column totals of _B n C°". The (i, j)th entry of i? n C^ is Ci^„.(^j) D Bi for 
which we have \\Ci^c,,(]) nBi||= ||Ci,^,o)||||Si||= x^P^^^^^-^ a.s. 

By this construction, B{t) is constructed according to a Poisson point process with 
the same law as that described in section [4.11 and B{t) possesses ranked asymptotic 
frequencies which correspond to X{t) almost surely for all t > 0. 

Corollary 5.1. X{t) :— {A{B{t)),t > 0) exists almost surely. 

5.2. Equilibrium measure 

Just as the process {B{t),t > 0) on V'^^^ converges to a stationary distribution, so 
does its associated measure-valued process {X{t),t > 0) from section [STTl 



Theorem 5.2. The associated measure-valued process X for a g^-Markov process with 
unique stationary measure 0{-; v) has equilibrium measure 9{-; v), the distribution of the 
ranked frequencies of a 9{-; i^) -partition. 

Proof. Proposition 1.4 in [5 states that if a sequence of exchangeable random parti- 
tions converges in law on V to tToo then its sequence of ranked asymptotic frequencies 
converges in law to ItTooI"''. Hence, from corollarv l4.1l we have that X has equilibrium 
distribution given by the ranked asymptotic frequencies of a 9{-; i/)-partition. 

6. The (a, fe)-Markov process 

Pitman |17) discusses a two-parameter family of infinitely exchangeable random 
partitions called the (a, 0) process which has finite-dimensional distributions 

p„iB;a,e):^^^Ml^l[-i-a)^*'', (6.1) 

beB 

for (a, 9) satisfying either 

• a = — K < and 9 — mn for some rn = 1, 2, . . . , or 
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• < a < 1 and 9 > -a. 
For fc G N and a > 0, a {—a, ka) partition has finite-dimensional distributions 

^"(^' "' ''^ - [k-mi nka+n)/nka) (^■2) 

(k) 

whose support is V^J. 

The distribution of the ranked asymptotic frequencies of an (a, 9) partition is called 
the Poisson-Dirichlct distribution with parameter (a,^), written PD(q!, 0). 

For notational convenience, introduce the a-permanent |14] of an n x n matrix K , 

n 

aeSn i=l 

where #cr is the number of cycles of the permutation a, and note that when B G Vin] 
is regarded as a matrix, 

per„ B=l[ per, i?|b = [] r(a + #6)/r(a), (6.3) 

b£B b£B 

which allows us to write (16.21) as 



where (^)'^" = P{(3 + 1) ■ ■ ■ (^ + n - 1). 

We now consider a specific sub-family of reversible £i,y-Markov processes for which 
the transition probabilities can be written down explicitly. For k gN and a > 0, let v 
be the PD(— a/fc, a) distribution on 'Pm and define transition probabilities according 
to the matrix construction based on i' as in section [31 We call this process the (a, k)- 
Markov process. 

Proposition 6.1. The (a,k)-Markov process has finite- dimensional transition proba- 
bilities 

Pn{B,B;a,k) = — — -7^777 M ^^ ^, . „ , wt^^ n (6-5 

(k ~ #B')\ ^A^A r(a + #6)/r(a) 

k\ per ^/, (BAB') 

(6.6) 



{k-#B')\ per^B 
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Proof. Theorem 3.2 and definition 3.3 from [T7] sfiows tlrat tlie distribution of _B ^ 
Qu where v = PD(— a/fc,Q) is 

Combining this and p.l|) yields (|6.5p : (16.61) follows from (|6.3p . 

Proposition 6.2. For eac/i (a, /c) G M+ x N and n G N, Pn{-,-;a,k) defined in 
vrovosition \6.1\ is reversible with respect to (j6.2p with parameter (a, fc). 

Proof. Let p„(-; a, fc) be the distribution with parameter (a, fc) defined in (|6.2p . and 
p„(-, •; a, k) be as defined in (|6.5p . For any i?, i?' G 7^r„i , it is immediate that 

Pn{B\ a, k)pn{B, B'; a, k) = p„{B'; a, k)pn{B', B; a, fc), (6.7) 

which establishes reversibility. 

Bertoin [4] discusses some reversible EFC processes which have PD(q;, 9) distribution as 
their equilibrium measure, for < a < 1 and > —a. Here we have shown reversibility 
with respect to PD(q;, 9) for a < and 9 — —ma for m e N. 

The construction of the continuous-time process is a special case of the procedure in 
section[31 The measure- valued process (X(t), t > 0) based on the (a, fc)-Markov process 
has unique stationary measure PD(— a, ka), the distribution of the ranked frequencies 
of a partition with finite-dimensional distributions as in (16.21) with parameter (a, fc). 

7. Discussion 

The paths of the pj,-Markov process discussed above are confined to 'P^''\ Unlike 
the EFC-process [5] , which has a natural interpretation as a model in certain physical 
sciences, the g^-Markov process has no clear interpretation as a physical model. How- 
ever, the matrix construction introduced in section [3] leads to transition rates which 
admit a closed form expression in the case of the {a, fc)-Markov process. 

The {a, fc) class of models could be useful as a statistical model for relationships 
among statistical units which are known to fall into one of fc classes. In statistical 
work, it is important that any observation has positive probability under the specified 
model. The (a, fc)-process assigns positive probability to all possible transitions and so 
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any observed sequence of partitions in V^J will have positive probability for any choice 
of a > 0. In addition, the model is exchangeable, consistent and reversible, particularly 
attractive mathematical properties which could have a natural interpretation in certain 
applications. Future work is intended to explore applications for this model, as well as 
develop some of the tools necessary for its use in statistical inference. 
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